Cluster Analysis on High-Dimensional Data: A Comparison of Density-based Clustering Algorithms
نویسندگان
چکیده
The effectiveness and efficiency of the existing cluster analysis methods are limited, especially when the referred data has high dimensions or when the clusters within the data are not well-separated and having different densities, sizes and shapes. Density-based clustering algorithms have been proven able to discovered clusters with those characteristics. Previous researchers which explored density-based clustering algorithms focused on the analyzing the parameters essential for creating meaningful spatial clusters. The aim of this paper is to provide a comparative study of three well know density-based clustering algorithms including DBSCAN, DENCLUE and LTKC. The merits of them were evaluated of their ability to cluster several high-dimensional artificial data. We concluded that each density-based data clustering algorithm has their individual merits for highdimensional data. However, further research is needed in the application of the techniques to analyze other high-dimensional data, to permit a comprehensive evaluation of their respective strengths and limitations as powerful cluster analysis methods.
منابع مشابه
High-Dimensional Unsupervised Active Learning Method
In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...
متن کاملAssessment of the Performance of Clustering Algorithms in the Extraction of Similar Trajectories
In recent years, the tremendous and increasing growth of spatial trajectory data and the necessity of processing and extraction of useful information and meaningful patterns have led to the fact that many researchers have been attracted to the field of spatio-temporal trajectory clustering. The process and analysis of these trajectories have resulted in the extraction of useful information whic...
متن کاملClustering for High Dimensional Data: Density based Subspace Clustering Algorithms
Finding clusters in high dimensional data is a challenging task as the high dimensional data comprises hundreds of attributes. Subspace clustering is an evolving methodology which, instead of finding clusters in the entire feature space, it aims at finding clusters in various overlapping or non-overlapping subspaces of the high dimensional dataset. Density based subspace clustering algorithms t...
متن کاملAn E cient Approach to Clustering in Large Multimedia Databaseswith
Several clustering algorithms can be applied to clustering in large multimedia databases. The eeectiveness and eeciency of the existing algorithms, however, is somewhat limited, since clustering in multimedia databases requires clustering high-dimensional feature vectors and since multimedia databases often contain large amounts of noise. In this paper , we therefore introduce a new algorithm t...
متن کاملA Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm
Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013